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Abstract 

Polynomial algorithms are given for the following two problems: 

• given a graph with n vertices and m edges, where m > 3n 3 / 2 , find a complete 
balanced bipartite subgraph with parts about ln (^" m ) ; 

• given a graph with n vertices, find a decomposition of its edges into complete 
balanced bipartite graphs having altogether 0(n 2 / Inn) vertices. 

Previous proofs of the existence of such objects, due to K6vari-S6s-Turan [TO] , 
Chung-Erdos-Spencer [5], Bublitz [4j and Tuza [13] were non-constructive. 



1 Introduction 

Determining the minimal number of edges in a bipartite graph which guarantees the 
existence of a complete balanced bipartite subgraph K q>q is known as the Zarankiewicz 
problem (see, e.g., Bollobas |3J). It was shown by Kovari, Sos and Turan [TU] that every 
bipartite graph with n vertices in both sides and c q n 2 ~ l l q edges contains a K qq . The 
same bound (with different constant c q ) holds for general n-vertex graphs. The argument 
from [10] also shows that n-vertex graphs of constant density, i.e., graphs with en 2 edges, 
contain a complete bipartite graph with parts of size at least c e lnn. The proofs of all 
these results are based on counting, and thus are non-constructive. 
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We consider the question whether such subgraphs can be found by efficient, i.e., polyno- 
mial time, algorithms. This question has been considered recently by Kirchner [9], who 
gave an efficient algorithm to find a complete balanced bipartite subgraph with parts of 
size Sl(vkui) in graphs of constant density. We improve this result by giving an efficient 
algorithm which finds a complete balanced bipartite subgraph with parts of size fi(lnn), 
i.e. of the optimal order of magnitude, in graphs of constant density. Our algorithm gives 
subgraphs of similar size as the counting argument in other ranges as well □. 

Finding a largest balanced complete bipartite subgraph is an important optimization 
problem, which is known to be NP-hard, and even hard to approximate (see, e.g., Feige and 
Kogan [6]). We would like to emphasize that we are not trying to give an approximation 
algorithm for this problem. Our objective is to give an efficient algorithm which finds a 
balanced complete bipartite subgraph of size close to the largest size that is guaranteed to 
exist knowing only the number of edges in the graph. Thus, even in a dense graph, we are 
finding a subgraph of logarithmic size only. Results of this type are given, for example, 
in Alon et al. pQ. 

The counting argument of |10j has several applications to other combinatorial problems. 
It seems to be an interesting question whether the algorithmic version of the counting 
argument leads to further algorithmic results in these applications. As a case in point, 
we consider the question of decomposing, or partitioning, the edge set of a graph into 
complete bipartite graphs. The motivation to look for such algorithms comes from an 
application in approximation algorithms [2]. 

Every n- vertex graph can be decomposed into at most n — 1 stars, and Graham and Pollak 
[7j showed that n — 1 complete bipartite graphs are necessary for the n-vertex complete 
graph. Instead of minimizing the number of complete bipartite graphs in a decomposition, 
one can also try to minimize the complexity of decompositions, measured by the sum of 
the number of vertices of the complete bipartite graphs used in the decomposition. This 
measure of complexity was suggested by Tarjan [12] in the context of circuit complexity. 
For recent connections to circuit complexity see Jukna [8]. 

It was shown by Chung, Erdos and Spencer [5], and by Bublitz [I], that there is always 
a decomposition of complexity 0(n 2 / Inn), and this order of magnitude is best possible. 
Similar results were obtained by Tuza [13] for decomposing bipartite graphs. These results 
are obtained by repeatedly applying the counting argument to show the existence of a 

1 Note that the problem becomes meaningless in the sense studied here for fewer than n 3 / 2 edges, as 
such graphs do not always contain even Ki^. subgraphs. 
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large complete bipartite graph and removing its edges. Thus the decomposition results 
obtained in [H [5l [13] are also non-constructive. As a direct application of our algorithm 
for finding bipartite subgraphs, we obtain efficient algorithms to find decompositions of 
complexity 0(n 2 / Inn). 

2 Complete balanced bipartite subgraphs 

Searching for a K qA by checking all subgraphs of that size would give an algorithm with 
superpolynomial running time if q is, say, logarithmic in the number of vertices. A 
polynomial algorithm could be given by restricting the search space to a polynomial size 
set of candidate subgraphs. One possibility for that would be to find a bipartite subgraph 
(R, S) with the following properties: 

• it is dense enough for the known results to guarantee the existence of a K q ^ q , and 

• the number of g-element subsets of R is only polynomial. 

If such an (R, S) can be found efficiently then a required K qA is obtained by checking 
the common neighborhood of all g-element subsets of R. It turns out that this approach 
indeed works if one chooses R to be the right number of vertices with maximal degree and 
S to be the remaining vertices. Thus, we consider the following algorithm, where q(n, m) 
and r(n, m) are functions to be determined. 

Algorithm FIND-BIPARTITE 

input: G = (V, E) with \V\ = n and \E\ = m 

q := q(n,m), r := r(n,m), 
R := r vertices having highest degree 
for all subsets C C R with |C| = q do 
D := f]{N(v)-R : v G C} 

if |D| > q then D' := the first q elements of D, return (C, D') 
We now show that with the appropriate choice of q(n, m) and r(n, m) the algorithm works. 
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Theorem 1 Let 



q := 



ln(n/2) 




qn 2 


ln(2en 2 /m)_ 


, r : = 


m 



If n is sufficiently large and m > 3n 3 / 2 then Algorithm FIND-BIPARTITE returns a 
K qq (with q > 2 as long as m> 8n 3 / 2 ). The running time of the algorithm is polynomial 
in n. 

Remark. Note that our algorithm finds a K qA in an n-vertex graph with m = c q n 2 ~ x l q 
edges as long as c q is large. This is optimal for q = 2, 3 as there exist n-vertex graphs 
with c' q n 2 ~ l / q edges and no K q>q , and if certain conjectures in extremal graph theory are 
true, then it is also optimal for q > 3. 

Proof. After selecting i < r vertices, the number of edges incident to these vertices is 
less than rn. Hence in the subgraph induced by the remaining vertices there is a vertex 
of degree at least 2(m — rn)/n. Thus if R is the set of r highest degree vertices in G then 

E, \ 2r(m — rn) 
n 

veR 

Hence the bipartite graph H with parts R,V — R and edge set comprising those edges of 
G with one endpoint in R and the other in V — R has at least 2rm/n — 3r 2 edges. 

We will now argue that rm/n > 3r 2 . Indeed, rm/n > 3r 2 is equivalent to r < m/3n. Now 
r < qn 2 /m so it is enough to show that qn 2 /m < m/3n or equivalently, that 3qn 3 < m 2 . 
Using the definition of q, we see that 3qn 3 < m 2 follows from 

m 2 ln(2en 2 /m) > 3n 3 ln(n/2). 

Suppose first that 3n 3 ^ 2 < m < 3n 3 / 2 Vlnn. Then 

m 2 ln (^-] > 9n 3 ln ( \ > g n 3 ln f L^_\ > 4 n ^\ nn > 3n 3 ln(n/2). 

On the other hand, if m > 3n 3 / 2 vhin, then using m < n 2 /2 we have 

m 2 ln(2en 2 /m) > 9n 3 lnnln(2en 2 /m) > 9n 3 lnnln(4e) > 3n 3 ln(n/2). 
We conclude that H has at least 2rm/n — 3r 2 > rm/n edges. 
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For the correctness of the algorithm it is sufficient to show that H contains a copy of K q>q . 
This follows by the counting argument referred to in the introduction, which is included 
here for completeness. Let s denote the number of stars with centers in V — R and q 
leaves. Then 

EfdeqH{v)\ nfrm/n' 
( V ) * ' 

v&V-R v 



q J 2 V q 



using the convexity of the function which is y^j if x > q — 1 and otherwise, and using 
r < n/2 which follows by the lower bound on m. If the latter quantity is greater than 
(q — 1) (p then there is a g-subset of R which is the leaf set for at least q distinct stars, and 
this gives a copy of K qA . Observe that the definition of q implies that n/2 > (2en 2 /m) q 
and this is equivalent to 

n ( rm \ 9 ( 2er\ 9 



2 \n 2 qj \ q J 



Now the inequality above and standard estimates of the binomial coefficients give 
n ( rm/ n 2 \ n ( rm \ q ( 2er \ q ( re \ q , . (r~ 



Thus H indeed contains a K q q . 



In order to show that the running time of the algorithm is polynomial, note first that, 
assuming an adjacency matrix representation, the set R can be found in 0(n 2 ) steps. For 
a given g-subset of R, the common neighbors can be found in 0{nq) steps. All g-subsets 
can be listed in O(Q) steps (see, e.g. [TT]). Thus the algorithm requires time 



2 ' r 



\ n + \ q ) n 1 



The number of iterations is at most 



q) V q J 



q < g lnn/ln4e _ l/ln4e , ,,0. U!ir, 



Now m < n 2 /2 implies that 
and q < Inn/ ln(n 2 /m) implies that 

e q\a{n 2 /m) < g lnn _ 

Therefore the running time of the algorithm is 0{n 2A2 ). □ 
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3 Decomposition into balanced complete bipartite sub- 
graphs 



Given a graph G = (V, E), we consider complete bipartite subgraphs Gi = (A4, Bi, Ei), i = 
l,...,t such that the edges sets Ei form a partition of E. The complexity of such a 
decomposition is measured by the total number of vertices, i.e., by 



We find a decomposition of complexity 0(n 2 / Inn). The decomposition contains balanced 
bipartite graphs, thus \Ai\ = \Bi\ holds as well. The algorithm uses Algorithm FIND- 
BIPARTITE in a straightforward manner. As stated, Algorithm FIND-BIPARTITE 
is guaranteed to work only if n > no for some no- As we are only interested in proving 
an asymptotic result, let us assume that graphs on fewer vertices are handled by some 
brute-force method. 

Algorithm FIND-DECOMPOSITION 

Given an n- vertex input graph G = (V, E) , if n < n , use a brute-force 
method to find an optimal decomposition of G. Else, use Algorithm FIND- 
BIPARTITE repeatedly to find a complete balanced bipartite subgraph and 
delete it from the current graph, as long as there are more than n 2 / Inn edges. 
After that, form a separate bipartite graph from each remaining edge. 

Theorem 2 For every n-vertex graph G, Algorithm FIND-DECOMPOSITION finds 
a decomposition of G into balanced complete bipartite graphs, having complexity 



The running time of the algorithm is polynomial in n. 

Proof. As the size of the subgraphs produced by Algorithm FIND-BIPARTITE is 

of the same order of magnitude as guaranteed by the existence theorems, the theorem 
follows as in [HO [13]. For completeness, we give the argument, following [T3] . 



t 
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Let the subgraphs produced by the calls of Algorithm FIND-BIPARTITE be = 
(Ai, Bj) with |v4j| = \Bj\ — g,, where i — 1, . . . , t for some t . We need to show that 



Let us divide the iterations of the algorithm into phases. The tih phase consists of 
those iterations where the number of edges in the input graph of Algorithm FIND- 
BIPARTITE is more than n 2 /(£ + 1) and at most n 2 /£. Dividing up the term q% in (1T1) 
between the q 2 edges of G iy each edge gets a weight of We have to upper bound the 
sum of the weights assigned to the edges. 

It follows from the definition of in Theorem [T] that graphs formed in the i'ih phase 
have qi = 0(Lm/ln£). Thus edges, which get their weight in the £'th phase, get a 
weight of @(ln£/lnn). The number of edges getting their weight in the £'th phase is 
B((| — j^i)n 2 ) = Q(n 2 /£ 2 ). Hence the total weight assigned to the edges is at most of 
the order of magnitude 



as is convergent. The polynomiality of the running time follows directly from the 

polynomial running time of Algorithm FIND-BIPARTITE. □ 

4 Subgraphs and decompositions of bipartite graphs 

In this section we formulate the result analogous to Theorem [2] for bipartite graphs G = 
(A,B,E) having parts A and B, with \A\ = a, \B\ = b and \E\ = m. We assume w.l.o.g. 
that a > b. 

The algorithms and their analysis are straightforward modifications of those for gen- 
eral graphs. Algorithm FIND-BIPARTITE-IN-BIPARTITE, a modified version of 
FIND-BIPARTITE, uses functions q(a,b,m) and r(a,b,m). It constructs R as the set 
of r highest degree vertices in B, and checks the common neighborhood of all q element 
subsets of R. Algorithm FIND-DECOMPOSITION-IN-BIPARTITE, a modified 
version of FIND-DECOMPOSITION, uses this modified algorithm while the number 
of edges is greater than abj ln(a + b). 

Theorem 3 Let G be a bipartite graph with sides of size a and b. Algorithm FIND- 
DECOMPOSITION-IN-BIPARTITE finds a decomposition of G into balanced com- 




(1) 
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plete bipartite graphs, having complexity 

o ■ ( ' h 



\n(a + 6) 

The running time of the algorithm is polynomial in a + b. 
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